Overview

Dataset statistics

Number of variables16
Number of observations558837
Missing cells123494
Missing cells (%)1.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory418.1 MiB
Average record size in memory784.4 B

Variable types

Numeric5
Text8
Categorical3

Alerts

transmission is highly imbalanced (90.1%)Imbalance
interior is highly imbalanced (50.4%)Imbalance
make has 10301 (1.8%) missing valuesMissing
model has 10399 (1.9%) missing valuesMissing
trim has 10715 (1.9%) missing valuesMissing
body has 13195 (2.4%) missing valuesMissing
transmission has 65321 (11.7%) missing valuesMissing
condition has 11820 (2.1%) missing valuesMissing

Reproduction

Analysis started2024-09-11 11:32:15.652332
Analysis finished2024-09-11 11:32:29.992671
Duration14.34 seconds
Software versionydata-profiling v0.0.dev0
Download configurationconfig.json

Variables

year
Real number (ℝ)

Distinct34
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2010.0389
Minimum1982
Maximum2015
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 MiB
2024-09-11T15:32:30.021170image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum1982
5-th percentile2002
Q12007
median2012
Q32013
95-th percentile2014
Maximum2015
Range33
Interquartile range (IQR)6

Descriptive statistics

Standard deviation3.9668636
Coefficient of variation (CV)0.0019735258
Kurtosis1.0105063
Mean2010.0389
Median Absolute Deviation (MAD)2
Skewness-1.1832259
Sum1.1232841 × 109
Variance15.736007
MonotonicityNot monotonic
2024-09-11T15:32:30.081672image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
2012 102315
18.3%
2013 98168
17.6%
2014 81070
14.5%
2011 48548
8.7%
2008 31502
 
5.6%
2007 30845
 
5.5%
2006 26913
 
4.8%
2010 26485
 
4.7%
2005 21394
 
3.8%
2009 20594
 
3.7%
Other values (24) 71003
12.7%
ValueCountFrequency (%)
1982 2
 
< 0.1%
1983 1
 
< 0.1%
1984 5
 
< 0.1%
1985 10
 
< 0.1%
1986 11
 
< 0.1%
1987 8
 
< 0.1%
1988 11
 
< 0.1%
1989 20
 
< 0.1%
1990 49
< 0.1%
1991 67
< 0.1%
ValueCountFrequency (%)
2015 9437
 
1.7%
2014 81070
14.5%
2013 98168
17.6%
2012 102315
18.3%
2011 48548
8.7%
2010 26485
 
4.7%
2009 20594
 
3.7%
2008 31502
 
5.6%
2007 30845
 
5.5%
2006 26913
 
4.8%

make
Text

MISSING 

Distinct96
Distinct (%)< 0.1%
Missing10301
Missing (%)1.8%
Memory size33.3 MiB
2024-09-11T15:32:30.214326image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length13
Median length11
Mean length5.9952236
Min length2

Characters and Unicode

Total characters3288596
Distinct characters49
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique8 ?
Unique (%)< 0.1%

Sample

1st rowKia
2nd rowKia
3rd rowBMW
4th rowVolvo
5th rowBMW
ValueCountFrequency (%)
ford 94001
17.1%
chevrolet 60587
 
11.0%
nissan 54017
 
9.8%
toyota 39966
 
7.3%
dodge 30956
 
5.6%
honda 27351
 
5.0%
hyundai 21837
 
4.0%
bmw 20793
 
3.8%
kia 18084
 
3.3%
chrysler 17485
 
3.2%
Other values (54) 165367
30.0%
2024-09-11T15:32:30.402523image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
o 328819
 
10.0%
e 300106
 
9.1%
a 235580
 
7.2%
r 230200
 
7.0%
d 215895
 
6.6%
n 186317
 
5.7%
i 184908
 
5.6%
s 178494
 
5.4%
t 128322
 
3.9%
l 116781
 
3.6%
Other values (39) 1183174
36.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3288596
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
o 328819
 
10.0%
e 300106
 
9.1%
a 235580
 
7.2%
r 230200
 
7.0%
d 215895
 
6.6%
n 186317
 
5.7%
i 184908
 
5.6%
s 178494
 
5.4%
t 128322
 
3.9%
l 116781
 
3.6%
Other values (39) 1183174
36.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3288596
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
o 328819
 
10.0%
e 300106
 
9.1%
a 235580
 
7.2%
r 230200
 
7.0%
d 215895
 
6.6%
n 186317
 
5.7%
i 184908
 
5.6%
s 178494
 
5.4%
t 128322
 
3.9%
l 116781
 
3.6%
Other values (39) 1183174
36.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3288596
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
o 328819
 
10.0%
e 300106
 
9.1%
a 235580
 
7.2%
r 230200
 
7.0%
d 215895
 
6.6%
n 186317
 
5.7%
i 184908
 
5.6%
s 178494
 
5.4%
t 128322
 
3.9%
l 116781
 
3.6%
Other values (39) 1183174
36.0%

model
Text

MISSING 

Distinct973
Distinct (%)0.2%
Missing10399
Missing (%)1.9%
Memory size33.7 MiB
2024-09-11T15:32:30.615099image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length29
Median length23
Mean length6.7691243
Min length1

Characters and Unicode

Total characters3712445
Distinct characters66
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique72 ?
Unique (%)< 0.1%

Sample

1st rowSorento
2nd rowSorento
3rd row3 Series
4th rowS60
5th row6 Series Gran Coupe
ValueCountFrequency (%)
altima 19432
 
2.9%
series 15429
 
2.3%
grand 14928
 
2.2%
f-150 14527
 
2.2%
1500 14476
 
2.2%
fusion 13639
 
2.0%
camry 13515
 
2.0%
escape 12027
 
1.8%
focus 10463
 
1.6%
g 9333
 
1.4%
Other values (740) 531345
79.4%
2024-09-11T15:32:30.906928image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 376585
 
10.1%
r 277169
 
7.5%
e 269750
 
7.3%
o 195221
 
5.3%
n 184979
 
5.0%
i 170339
 
4.6%
s 149945
 
4.0%
t 136207
 
3.7%
l 132705
 
3.6%
C 123260
 
3.3%
Other values (56) 1696285
45.7%

Most occurring categories

ValueCountFrequency (%)
(unknown) 3712445
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 376585
 
10.1%
r 277169
 
7.5%
e 269750
 
7.3%
o 195221
 
5.3%
n 184979
 
5.0%
i 170339
 
4.6%
s 149945
 
4.0%
t 136207
 
3.7%
l 132705
 
3.6%
C 123260
 
3.3%
Other values (56) 1696285
45.7%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 3712445
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 376585
 
10.1%
r 277169
 
7.5%
e 269750
 
7.3%
o 195221
 
5.3%
n 184979
 
5.0%
i 170339
 
4.6%
s 149945
 
4.0%
t 136207
 
3.7%
l 132705
 
3.6%
C 123260
 
3.3%
Other values (56) 1696285
45.7%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 3712445
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 376585
 
10.1%
r 277169
 
7.5%
e 269750
 
7.3%
o 195221
 
5.3%
n 184979
 
5.0%
i 170339
 
4.6%
s 149945
 
4.0%
t 136207
 
3.7%
l 132705
 
3.6%
C 123260
 
3.3%
Other values (56) 1696285
45.7%

trim
Text

MISSING 

Distinct1963
Distinct (%)0.4%
Missing10715
Missing (%)1.9%
Memory size32.6 MiB
2024-09-11T15:32:31.165307image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length46
Median length37
Mean length4.7365805
Min length1

Characters and Unicode

Total characters2596224
Distinct characters72
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique241 ?
Unique (%)< 0.1%

Sample

1st rowLX
2nd rowLX
3rd row328i SULEV
4th rowT5
5th row650i
ValueCountFrequency (%)
base 56122
 
8.3%
se 48390
 
7.2%
s 30312
 
4.5%
lx 21376
 
3.2%
limited 20582
 
3.1%
lt 20224
 
3.0%
2.5 18864
 
2.8%
xlt 18796
 
2.8%
ls 17932
 
2.7%
sport 17602
 
2.6%
Other values (963) 402154
59.8%
2024-09-11T15:32:31.489770image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
L 214993
 
8.3%
S 206476
 
8.0%
e 155206
 
6.0%
i 135234
 
5.2%
E 127024
 
4.9%
124233
 
4.8%
T 120883
 
4.7%
a 108838
 
4.2%
r 97787
 
3.8%
X 91515
 
3.5%
Other values (62) 1214035
46.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2596224
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
L 214993
 
8.3%
S 206476
 
8.0%
e 155206
 
6.0%
i 135234
 
5.2%
E 127024
 
4.9%
124233
 
4.8%
T 120883
 
4.7%
a 108838
 
4.2%
r 97787
 
3.8%
X 91515
 
3.5%
Other values (62) 1214035
46.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2596224
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
L 214993
 
8.3%
S 206476
 
8.0%
e 155206
 
6.0%
i 135234
 
5.2%
E 127024
 
4.9%
124233
 
4.8%
T 120883
 
4.7%
a 108838
 
4.2%
r 97787
 
3.8%
X 91515
 
3.5%
Other values (62) 1214035
46.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2596224
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
L 214993
 
8.3%
S 206476
 
8.0%
e 155206
 
6.0%
i 135234
 
5.2%
E 127024
 
4.9%
124233
 
4.8%
T 120883
 
4.7%
a 108838
 
4.2%
r 97787
 
3.8%
X 91515
 
3.5%
Other values (62) 1214035
46.8%

body
Text

MISSING 

Distinct87
Distinct (%)< 0.1%
Missing13195
Missing (%)2.4%
Memory size32.8 MiB
2024-09-11T15:32:31.593957image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length23
Median length5
Mean length5.2792729
Min length3

Characters and Unicode

Total characters2880593
Distinct characters48
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique5 ?
Unique (%)< 0.1%

Sample

1st rowSUV
2nd rowSUV
3rd rowSedan
4th rowSedan
5th rowSedan
ValueCountFrequency (%)
sedan 248760
42.1%
suv 143844
24.3%
cab 33137
 
5.6%
hatchback 26237
 
4.4%
minivan 25529
 
4.3%
coupe 19983
 
3.4%
crew 16394
 
2.8%
wagon 16180
 
2.7%
convertible 10933
 
1.9%
g 9333
 
1.6%
Other values (33) 40608
 
6.9%
2024-09-11T15:32:31.808682image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 398031
13.8%
e 352374
12.2%
n 338855
11.8%
S 338282
11.7%
d 262219
 
9.1%
V 124796
 
4.3%
U 119292
 
4.1%
C 78559
 
2.7%
b 77463
 
2.7%
s 73869
 
2.6%
Other values (38) 716853
24.9%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2880593
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 398031
13.8%
e 352374
12.2%
n 338855
11.8%
S 338282
11.7%
d 262219
 
9.1%
V 124796
 
4.3%
U 119292
 
4.1%
C 78559
 
2.7%
b 77463
 
2.7%
s 73869
 
2.6%
Other values (38) 716853
24.9%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2880593
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 398031
13.8%
e 352374
12.2%
n 338855
11.8%
S 338282
11.7%
d 262219
 
9.1%
V 124796
 
4.3%
U 119292
 
4.1%
C 78559
 
2.7%
b 77463
 
2.7%
s 73869
 
2.6%
Other values (38) 716853
24.9%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2880593
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 398031
13.8%
e 352374
12.2%
n 338855
11.8%
S 338282
11.7%
d 262219
 
9.1%
V 124796
 
4.3%
U 119292
 
4.1%
C 78559
 
2.7%
b 77463
 
2.7%
s 73869
 
2.6%
Other values (38) 716853
24.9%

transmission
Categorical

IMBALANCE  MISSING 

Distinct5
Distinct (%)< 0.1%
Missing65321
Missing (%)11.7%
Memory size35.0 MiB
automatic
475677 
manual
 
17539
horse-driven
 
274
sedan
 
15
Sedan
 
11

Length

Max length12
Median length9
Mean length8.8948383
Min length5

Characters and Unicode

Total characters4389745
Distinct characters17
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowautomatic
2nd rowautomatic
3rd rowautomatic
4th rowautomatic
5th rowautomatic

Common Values

ValueCountFrequency (%)
automatic 475677
85.1%
manual 17539
 
3.1%
horse-driven 274
 
< 0.1%
sedan 15
 
< 0.1%
Sedan 11
 
< 0.1%
(Missing) 65321
 
11.7%

Length

2024-09-11T15:32:31.879185image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-09-11T15:32:31.931183image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
automatic 475677
96.4%
manual 17539
 
3.6%
horse-driven 274
 
0.1%
sedan 26
 
< 0.1%

Most occurring characters

ValueCountFrequency (%)
a 986458
22.5%
t 951354
21.7%
u 493216
11.2%
m 493216
11.2%
o 475951
10.8%
i 475951
10.8%
c 475677
10.8%
n 17839
 
0.4%
l 17539
 
0.4%
e 574
 
< 0.1%
Other values (7) 1970
 
< 0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 4389745
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 986458
22.5%
t 951354
21.7%
u 493216
11.2%
m 493216
11.2%
o 475951
10.8%
i 475951
10.8%
c 475677
10.8%
n 17839
 
0.4%
l 17539
 
0.4%
e 574
 
< 0.1%
Other values (7) 1970
 
< 0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 4389745
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 986458
22.5%
t 951354
21.7%
u 493216
11.2%
m 493216
11.2%
o 475951
10.8%
i 475951
10.8%
c 475677
10.8%
n 17839
 
0.4%
l 17539
 
0.4%
e 574
 
< 0.1%
Other values (7) 1970
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 4389745
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 986458
22.5%
t 951354
21.7%
u 493216
11.2%
m 493216
11.2%
o 475951
10.8%
i 475951
10.8%
c 475677
10.8%
n 17839
 
0.4%
l 17539
 
0.4%
e 574
 
< 0.1%
Other values (7) 1970
 
< 0.1%

vin
Text

Distinct550297
Distinct (%)98.5%
Missing4
Missing (%)< 0.1%
Memory size39.4 MiB
2024-09-11T15:32:32.194857image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length17
Median length17
Mean length16.999585
Min length3

Characters and Unicode

Total characters9499929
Distinct characters36
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique541970 ?
Unique (%)97.0%

Sample

1st row5xyktca69fg566472
2nd row5xyktca69fg561319
3rd rowwba3c1c51ek116351
4th rowyv1612tb4f1310987
5th rowwba6b2c57ed129731
ValueCountFrequency (%)
automatic 22
 
< 0.1%
wbanv13588cz57827 5
 
< 0.1%
wp0ca2988xu629622 4
 
< 0.1%
1ftfw1cv5afb30053 4
 
< 0.1%
wddgf56x78f009940 4
 
< 0.1%
5uxfe43579l274932 4
 
< 0.1%
5n1ar1nn2bc632869 4
 
< 0.1%
trusc28n241022003 4
 
< 0.1%
1hgcp3f8xca021624 3
 
< 0.1%
1n6aa06b06n500808 3
 
< 0.1%
Other values (550287) 558776
> 99.9%
2024-09-11T15:32:32.536957image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 919854
 
9.7%
2 636863
 
6.7%
3 612467
 
6.4%
5 595556
 
6.3%
4 574940
 
6.1%
0 498895
 
5.3%
6 487519
 
5.1%
7 458863
 
4.8%
8 455044
 
4.8%
c 381530
 
4.0%
Other values (26) 3878398
40.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 9499929
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1 919854
 
9.7%
2 636863
 
6.7%
3 612467
 
6.4%
5 595556
 
6.3%
4 574940
 
6.1%
0 498895
 
5.3%
6 487519
 
5.1%
7 458863
 
4.8%
8 455044
 
4.8%
c 381530
 
4.0%
Other values (26) 3878398
40.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 9499929
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1 919854
 
9.7%
2 636863
 
6.7%
3 612467
 
6.4%
5 595556
 
6.3%
4 574940
 
6.1%
0 498895
 
5.3%
6 487519
 
5.1%
7 458863
 
4.8%
8 455044
 
4.8%
c 381530
 
4.0%
Other values (26) 3878398
40.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 9499929
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1 919854
 
9.7%
2 636863
 
6.7%
3 612467
 
6.4%
5 595556
 
6.3%
4 574940
 
6.1%
0 498895
 
5.3%
6 487519
 
5.1%
7 458863
 
4.8%
8 455044
 
4.8%
c 381530
 
4.0%
Other values (26) 3878398
40.8%

state
Text

Distinct64
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size31.4 MiB
2024-09-11T15:32:32.657217image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length17
Median length2
Mean length2.0006979
Min length2

Characters and Unicode

Total characters1118064
Distinct characters36
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)< 0.1%

Sample

1st rowca
2nd rowca
3rd rowca
4th rowca
5th rowca
ValueCountFrequency (%)
fl 82945
14.8%
ca 73148
13.1%
pa 53907
 
9.6%
tx 45913
 
8.2%
ga 34750
 
6.2%
nj 27784
 
5.0%
il 23486
 
4.2%
nc 21845
 
3.9%
oh 21575
 
3.9%
tn 20895
 
3.7%
Other values (54) 152589
27.3%
2024-09-11T15:32:32.834553image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
a 199889
17.9%
n 110349
9.9%
l 108648
9.7%
c 108264
9.7%
f 82971
 
7.4%
t 68644
 
6.1%
m 60888
 
5.4%
p 56632
 
5.1%
i 54410
 
4.9%
o 50032
 
4.5%
Other values (26) 217337
19.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 1118064
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 199889
17.9%
n 110349
9.9%
l 108648
9.7%
c 108264
9.7%
f 82971
 
7.4%
t 68644
 
6.1%
m 60888
 
5.4%
p 56632
 
5.1%
i 54410
 
4.9%
o 50032
 
4.5%
Other values (26) 217337
19.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 1118064
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 199889
17.9%
n 110349
9.9%
l 108648
9.7%
c 108264
9.7%
f 82971
 
7.4%
t 68644
 
6.1%
m 60888
 
5.4%
p 56632
 
5.1%
i 54410
 
4.9%
o 50032
 
4.5%
Other values (26) 217337
19.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 1118064
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 199889
17.9%
n 110349
9.9%
l 108648
9.7%
c 108264
9.7%
f 82971
 
7.4%
t 68644
 
6.1%
m 60888
 
5.4%
p 56632
 
5.1%
i 54410
 
4.9%
o 50032
 
4.5%
Other values (26) 217337
19.4%

condition
Real number (ℝ)

MISSING 

Distinct49
Distinct (%)< 0.1%
Missing11820
Missing (%)2.1%
Infinite0
Infinite (%)0.0%
Mean30.680052
Minimum0
Maximum982
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size4.3 MiB
2024-09-11T15:32:32.906553image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile2
Q123
median35
Q342
95-th percentile47
Maximum982
Range982
Interquartile range (IQR)19

Descriptive statistics

Standard deviation13.585974
Coefficient of variation (CV)0.4428276
Kurtosis99.271038
Mean30.680052
Median Absolute Deviation (MAD)8
Skewness0.77213291
Sum16782510
Variance184.57868
MonotonicityNot monotonic
2024-09-11T15:32:32.975560image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
19 42280
 
7.6%
35 26749
 
4.8%
37 25938
 
4.6%
44 25514
 
4.6%
43 24937
 
4.5%
42 24328
 
4.4%
36 23144
 
4.1%
41 23073
 
4.1%
2 20788
 
3.7%
4 19922
 
3.6%
Other values (39) 290344
52.0%
ValueCountFrequency (%)
0 1
 
< 0.1%
1 7363
 
1.3%
2 20788
3.7%
3 10802
1.9%
4 19922
3.6%
5 11222
2.0%
11 87
 
< 0.1%
12 95
 
< 0.1%
13 82
 
< 0.1%
14 134
 
< 0.1%
ValueCountFrequency (%)
982 1
 
< 0.1%
897 1
 
< 0.1%
849 1
 
< 0.1%
313 2
 
< 0.1%
289 2
 
< 0.1%
280 1
 
< 0.1%
193 1
 
< 0.1%
49 13099
2.3%
48 12712
2.3%
47 11362
2.0%

odometer
Real number (ℝ)

Distinct172276
Distinct (%)30.8%
Missing94
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean68311.538
Minimum0
Maximum999999
Zeros152
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size4.3 MiB
2024-09-11T15:32:33.044560image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile10491.1
Q128359
median52247
Q399108.5
95-th percentile170056.9
Maximum999999
Range999999
Interquartile range (IQR)70749.5

Descriptive statistics

Standard deviation53405.247
Coefficient of variation (CV)0.78178956
Kurtosis13.542316
Mean68311.538
Median Absolute Deviation (MAD)30480
Skewness1.8426065
Sum3.8168594 × 1010
Variance2.8521204 × 109
MonotonicityNot monotonic
2024-09-11T15:32:33.116552image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1318
 
0.2%
0 152
 
< 0.1%
999999 72
 
< 0.1%
10 29
 
< 0.1%
21587 21
 
< 0.1%
21310 18
 
< 0.1%
29137 18
 
< 0.1%
8 18
 
< 0.1%
2 18
 
< 0.1%
27023 17
 
< 0.1%
Other values (172266) 557062
99.7%
(Missing) 94
 
< 0.1%
ValueCountFrequency (%)
0 152
 
< 0.1%
1 1318
0.2%
2 18
 
< 0.1%
3 9
 
< 0.1%
4 9
 
< 0.1%
5 17
 
< 0.1%
6 13
 
< 0.1%
7 13
 
< 0.1%
8 18
 
< 0.1%
9 11
 
< 0.1%
ValueCountFrequency (%)
999999 72
< 0.1%
980113 1
 
< 0.1%
959276 1
 
< 0.1%
694978 2
 
< 0.1%
621388 1
 
< 0.1%
580956 1
 
< 0.1%
537334 1
 
< 0.1%
522212 1
 
< 0.1%
500227 1
 
< 0.1%
495757 1
 
< 0.1%

color
Categorical

Distinct46
Distinct (%)< 0.1%
Missing749
Missing (%)0.1%
Memory size33.6 MiB
black
110970 
white
106673 
silver
83389 
gray
82857 
blue
51139 
Other values (41)
123060 

Length

Max length9
Median length8
Mean length4.6275032
Min length1

Characters and Unicode

Total characters2582554
Distinct characters35
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique26 ?
Unique (%)< 0.1%

Sample

1st rowwhite
2nd rowwhite
3rd rowgray
4th rowwhite
5th rowgray

Common Values

ValueCountFrequency (%)
black 110970
19.9%
white 106673
19.1%
silver 83389
14.9%
gray 82857
14.8%
blue 51139
9.2%
red 43569
 
7.8%
— 24685
 
4.4%
green 11382
 
2.0%
gold 11342
 
2.0%
beige 9222
 
1.7%
Other values (36) 22860
 
4.1%

Length

2024-09-11T15:32:33.184560image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
black 110970
19.9%
white 106673
19.1%
silver 83389
14.9%
gray 82857
14.8%
blue 51139
9.2%
red 43569
 
7.8%
— 24685
 
4.4%
green 11382
 
2.0%
gold 11342
 
2.0%
beige 9222
 
1.7%
Other values (36) 22860
 
4.1%

Most occurring characters

ValueCountFrequency (%)
e 332602
12.9%
l 261465
 
10.1%
r 241240
 
9.3%
i 201026
 
7.8%
a 196863
 
7.6%
b 187020
 
7.2%
g 125853
 
4.9%
w 116124
 
4.5%
c 111928
 
4.3%
k 111012
 
4.3%
Other values (25) 697421
27.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2582554
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 332602
12.9%
l 261465
 
10.1%
r 241240
 
9.3%
i 201026
 
7.8%
a 196863
 
7.6%
b 187020
 
7.2%
g 125853
 
4.9%
w 116124
 
4.5%
c 111928
 
4.3%
k 111012
 
4.3%
Other values (25) 697421
27.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2582554
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 332602
12.9%
l 261465
 
10.1%
r 241240
 
9.3%
i 201026
 
7.8%
a 196863
 
7.6%
b 187020
 
7.2%
g 125853
 
4.9%
w 116124
 
4.5%
c 111928
 
4.3%
k 111012
 
4.3%
Other values (25) 697421
27.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2582554
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 332602
12.9%
l 261465
 
10.1%
r 241240
 
9.3%
i 201026
 
7.8%
a 196863
 
7.6%
b 187020
 
7.2%
g 125853
 
4.9%
w 116124
 
4.5%
c 111928
 
4.3%
k 111012
 
4.3%
Other values (25) 697421
27.0%

interior
Categorical

IMBALANCE 

Distinct17
Distinct (%)< 0.1%
Missing749
Missing (%)0.1%
Memory size33.2 MiB
black
244329 
gray
178581 
beige
59758 
tan
44093 
—
 
17077
Other values (12)
 
14250

Length

Max length9
Median length5
Mean length4.399437
Min length1

Characters and Unicode

Total characters2455273
Distinct characters23
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowblack
2nd rowbeige
3rd rowblack
4th rowblack
5th rowblack

Common Values

ValueCountFrequency (%)
black 244329
43.7%
gray 178581
32.0%
beige 59758
 
10.7%
tan 44093
 
7.9%
— 17077
 
3.1%
brown 8640
 
1.5%
red 1363
 
0.2%
blue 1143
 
0.2%
silver 1104
 
0.2%
off-white 480
 
0.1%
Other values (7) 1520
 
0.3%
(Missing) 749
 
0.1%

Length

2024-09-11T15:32:33.251560image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
black 244329
43.8%
gray 178581
32.0%
beige 59758
 
10.7%
tan 44093
 
7.9%
— 17077
 
3.1%
brown 8640
 
1.5%
red 1363
 
0.2%
blue 1143
 
0.2%
silver 1104
 
0.2%
off-white 480
 
0.1%
Other values (7) 1520
 
0.3%

Most occurring characters

ValueCountFrequency (%)
a 467148
19.0%
b 314061
12.8%
l 247279
10.1%
c 244329
10.0%
k 244329
10.0%
g 239244
9.7%
r 190608
7.8%
y 178792
 
7.3%
e 124856
 
5.1%
i 61598
 
2.5%
Other values (13) 143029
 
5.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2455273
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 467148
19.0%
b 314061
12.8%
l 247279
10.1%
c 244329
10.0%
k 244329
10.0%
g 239244
9.7%
r 190608
7.8%
y 178792
 
7.3%
e 124856
 
5.1%
i 61598
 
2.5%
Other values (13) 143029
 
5.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2455273
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 467148
19.0%
b 314061
12.8%
l 247279
10.1%
c 244329
10.0%
k 244329
10.0%
g 239244
9.7%
r 190608
7.8%
y 178792
 
7.3%
e 124856
 
5.1%
i 61598
 
2.5%
Other values (13) 143029
 
5.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2455273
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 467148
19.0%
b 314061
12.8%
l 247279
10.1%
c 244329
10.0%
k 244329
10.0%
g 239244
9.7%
r 190608
7.8%
y 178792
 
7.3%
e 124856
 
5.1%
i 61598
 
2.5%
Other values (13) 143029
 
5.8%

seller
Text

Distinct14264
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size42.6 MiB
2024-09-11T15:32:33.484813image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length50
Median length42
Mean length22.990081
Min length3

Characters and Unicode

Total characters12847708
Distinct characters47
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique4949 ?
Unique (%)0.9%

Sample

1st rowkia motors america inc
2nd rowkia motors america inc
3rd rowfinancial services remarketing (lease)
4th rowvolvo na rep/world omni
5th rowfinancial services remarketing (lease)
ValueCountFrequency (%)
inc 86909
 
4.6%
services 48240
 
2.6%
corporation 47850
 
2.5%
auto 47452
 
2.5%
credit 46959
 
2.5%
motor 45807
 
2.4%
llc 45554
 
2.4%
financial 44150
 
2.3%
ford 36212
 
1.9%
remarketing 35475
 
1.9%
Other values (8581) 1395305
74.2%
2024-09-11T15:32:33.794814image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1339787
 
10.4%
e 1146331
 
8.9%
a 1052355
 
8.2%
r 962265
 
7.5%
n 953446
 
7.4%
i 917296
 
7.1%
o 863200
 
6.7%
t 796230
 
6.2%
c 736349
 
5.7%
s 671368
 
5.2%
Other values (37) 3409081
26.5%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12847708
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
1339787
 
10.4%
e 1146331
 
8.9%
a 1052355
 
8.2%
r 962265
 
7.5%
n 953446
 
7.4%
i 917296
 
7.1%
o 863200
 
6.7%
t 796230
 
6.2%
c 736349
 
5.7%
s 671368
 
5.2%
Other values (37) 3409081
26.5%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12847708
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
1339787
 
10.4%
e 1146331
 
8.9%
a 1052355
 
8.2%
r 962265
 
7.5%
n 953446
 
7.4%
i 917296
 
7.1%
o 863200
 
6.7%
t 796230
 
6.2%
c 736349
 
5.7%
s 671368
 
5.2%
Other values (37) 3409081
26.5%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12847708
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
1339787
 
10.4%
e 1146331
 
8.9%
a 1052355
 
8.2%
r 962265
 
7.5%
n 953446
 
7.4%
i 917296
 
7.1%
o 863200
 
6.7%
t 796230
 
6.2%
c 736349
 
5.7%
s 671368
 
5.2%
Other values (37) 3409081
26.5%

mmr
Real number (ℝ)

Distinct1101
Distinct (%)0.2%
Missing123
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean13770.378
Minimum25
Maximum182000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 MiB
2024-09-11T15:32:33.867813image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum25
5-th percentile1800
Q17100
median12250
Q318300
95-th percentile30600
Maximum182000
Range181975
Interquartile range (IQR)11200

Descriptive statistics

Standard deviation9680.2193
Coefficient of variation (CV)0.70297414
Kurtosis11.443099
Mean13770.378
Median Absolute Deviation (MAD)5575
Skewness1.997564
Sum7.6937027 × 109
Variance93706645
MonotonicityNot monotonic
2024-09-11T15:32:33.937306image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
12500 1760
 
0.3%
11600 1751
 
0.3%
11650 1746
 
0.3%
12150 1722
 
0.3%
11850 1716
 
0.3%
11300 1716
 
0.3%
11750 1709
 
0.3%
12350 1702
 
0.3%
12700 1701
 
0.3%
11950 1694
 
0.3%
Other values (1091) 541497
96.9%
ValueCountFrequency (%)
25 30
 
< 0.1%
50 44
< 0.1%
75 23
 
< 0.1%
100 33
 
< 0.1%
125 40
< 0.1%
150 45
< 0.1%
175 69
< 0.1%
200 54
< 0.1%
225 60
< 0.1%
250 83
< 0.1%
ValueCountFrequency (%)
182000 1
 
< 0.1%
178000 1
 
< 0.1%
176000 1
 
< 0.1%
172000 1
 
< 0.1%
170000 3
< 0.1%
166000 3
< 0.1%
164000 1
 
< 0.1%
163000 1
 
< 0.1%
162000 1
 
< 0.1%
161000 1
 
< 0.1%

sellingprice
Real number (ℝ)

Distinct1887
Distinct (%)0.3%
Missing12
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean13611.359
Minimum1
Maximum230000
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.3 MiB
2024-09-11T15:32:34.007806image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1500
Q16900
median12100
Q318200
95-th percentile30600
Maximum230000
Range229999
Interquartile range (IQR)11300

Descriptive statistics

Standard deviation9749.5016
Coefficient of variation (CV)0.71627688
Kurtosis11.114646
Mean13611.359
Median Absolute Deviation (MAD)5650
Skewness1.9534444
Sum7.6063676 × 109
Variance95052782
MonotonicityNot monotonic
2024-09-11T15:32:34.147207image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
11000 4453
 
0.8%
12000 4450
 
0.8%
13000 4334
 
0.8%
10000 4029
 
0.7%
14000 3899
 
0.7%
11500 3876
 
0.7%
12500 3714
 
0.7%
9000 3689
 
0.7%
10500 3540
 
0.6%
15000 3386
 
0.6%
Other values (1877) 519455
93.0%
ValueCountFrequency (%)
1 4
 
< 0.1%
100 19
 
< 0.1%
125 1
 
< 0.1%
150 21
 
< 0.1%
175 10
 
< 0.1%
200 196
 
< 0.1%
225 105
 
< 0.1%
250 281
 
0.1%
275 124
 
< 0.1%
300 1282
0.2%
ValueCountFrequency (%)
230000 1
< 0.1%
183000 1
< 0.1%
173000 1
< 0.1%
171500 1
< 0.1%
169500 1
< 0.1%
169000 1
< 0.1%
167000 1
< 0.1%
165000 2
< 0.1%
163000 2
< 0.1%
161000 1
< 0.1%
Distinct3766
Distinct (%)0.7%
Missing12
Missing (%)< 0.1%
Memory size51.2 MiB
2024-09-11T15:32:34.378673image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Length

Max length39
Median length39
Mean length38.998409
Min length4

Characters and Unicode

Total characters21793286
Distinct characters40
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique604 ?
Unique (%)0.1%

Sample

1st rowTue Dec 16 2014 12:30:00 GMT-0800 (PST)
2nd rowTue Dec 16 2014 12:30:00 GMT-0800 (PST)
3rd rowThu Jan 15 2015 04:30:00 GMT-0800 (PST)
4th rowThu Jan 29 2015 04:30:00 GMT-0800 (PST)
5th rowThu Dec 18 2014 12:30:00 GMT-0800 (PST)
ValueCountFrequency (%)
2015 505072
 
12.9%
gmt-0800 395489
 
10.1%
pst 395489
 
10.1%
wed 166069
 
4.2%
tue 163950
 
4.2%
pdt 163310
 
4.2%
gmt-0700 163310
 
4.2%
feb 163053
 
4.2%
thu 153750
 
3.9%
jan 140815
 
3.6%
Other values (334) 1501312
38.4%
2024-09-11T15:32:34.673564image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 4819691
22.1%
3352794
15.4%
T 1435298
 
6.6%
: 1117598
 
5.1%
1 1061408
 
4.9%
2 962306
 
4.4%
M 673292
 
3.1%
5 660463
 
3.0%
G 558799
 
2.6%
) 558799
 
2.6%
Other values (30) 6592838
30.3%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21793286
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 4819691
22.1%
3352794
15.4%
T 1435298
 
6.6%
: 1117598
 
5.1%
1 1061408
 
4.9%
2 962306
 
4.4%
M 673292
 
3.1%
5 660463
 
3.0%
G 558799
 
2.6%
) 558799
 
2.6%
Other values (30) 6592838
30.3%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21793286
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 4819691
22.1%
3352794
15.4%
T 1435298
 
6.6%
: 1117598
 
5.1%
1 1061408
 
4.9%
2 962306
 
4.4%
M 673292
 
3.1%
5 660463
 
3.0%
G 558799
 
2.6%
) 558799
 
2.6%
Other values (30) 6592838
30.3%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21793286
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 4819691
22.1%
3352794
15.4%
T 1435298
 
6.6%
: 1117598
 
5.1%
1 1061408
 
4.9%
2 962306
 
4.4%
M 673292
 
3.1%
5 660463
 
3.0%
G 558799
 
2.6%
) 558799
 
2.6%
Other values (30) 6592838
30.3%

Interactions

2024-09-11T15:32:27.719080image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:25.847152image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.317520image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.766593image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.258482image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.811968image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:25.946653image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.408492image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.856968image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.350987image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.901968image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.039153image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.497209image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.942294image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.441132image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.992998image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.132653image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.586166image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.077467image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.531052image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:28.082003image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.228153image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:26.677835image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.169866image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-09-11T15:32:27.625580image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Missing values

2024-09-11T15:32:28.240994image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
A simple visualization of nullity by column.
2024-09-11T15:32:28.645951image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2024-09-11T15:32:29.534100image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

yearmakemodeltrimbodytransmissionvinstateconditionodometercolorinteriorsellermmrsellingpricesaledate
02015KiaSorentoLXSUVautomatic5xyktca69fg566472ca5.016639.0whiteblackkia motors america inc20500.021500.0Tue Dec 16 2014 12:30:00 GMT-0800 (PST)
12015KiaSorentoLXSUVautomatic5xyktca69fg561319ca5.09393.0whitebeigekia motors america inc20800.021500.0Tue Dec 16 2014 12:30:00 GMT-0800 (PST)
22014BMW3 Series328i SULEVSedanautomaticwba3c1c51ek116351ca45.01331.0grayblackfinancial services remarketing (lease)31900.030000.0Thu Jan 15 2015 04:30:00 GMT-0800 (PST)
32015VolvoS60T5Sedanautomaticyv1612tb4f1310987ca41.014282.0whiteblackvolvo na rep/world omni27500.027750.0Thu Jan 29 2015 04:30:00 GMT-0800 (PST)
42014BMW6 Series Gran Coupe650iSedanautomaticwba6b2c57ed129731ca43.02641.0grayblackfinancial services remarketing (lease)66000.067000.0Thu Dec 18 2014 12:30:00 GMT-0800 (PST)
52015NissanAltima2.5 SSedanautomatic1n4al3ap1fn326013ca1.05554.0grayblackenterprise vehicle exchange / tra / rental / tulsa15350.010900.0Tue Dec 30 2014 12:00:00 GMT-0800 (PST)
62014BMWM5BaseSedanautomaticwbsfv9c51ed593089ca34.014943.0blackblackthe hertz corporation69000.065000.0Wed Dec 17 2014 12:30:00 GMT-0800 (PST)
72014ChevroletCruze1LTSedanautomatic1g1pc5sb2e7128460ca2.028617.0blackblackenterprise vehicle exchange / tra / rental / tulsa11900.09800.0Tue Dec 16 2014 13:00:00 GMT-0800 (PST)
82014AudiA42.0T Premium Plus quattroSedanautomaticwauffafl3en030343ca42.09557.0whiteblackaudi mission viejo32100.032250.0Thu Dec 18 2014 12:00:00 GMT-0800 (PST)
92014ChevroletCamaroLTConvertibleautomatic2g1fb3d37e9218789ca3.04809.0redblackd/m auto sales inc26300.017500.0Tue Jan 20 2015 04:00:00 GMT-0800 (PST)
yearmakemodeltrimbodytransmissionvinstateconditionodometercolorinteriorsellermmrsellingpricesaledate
5588272014JeepGrand CherokeeLaredoSUVautomatic1c4rjfag0ec466276pa42.025180.0grayblackhertz corporation/gdp26000.024500.0Tue Jul 07 2015 06:30:00 GMT-0700 (PDT)
5588282012DodgeGrand CaravanAmerican Value PackageMinivanautomatic2c4rdgbg1cr349287ma37.097036.0silvergrayge fleet services for itself/servicer8300.07800.0Tue Jul 07 2015 06:30:00 GMT-0700 (PDT)
5588292012HyundaiElantraLimitedSedanNaN5npdh4ae7ch106397pa4.066720.0graygraychampion mazda10250.010400.0Wed Jul 08 2015 07:30:00 GMT-0700 (PDT)
5588302012NissanSentra2.0 SRSedanNaN3n1ab6ap3cl622485tn26.035858.0whitegraynissan-infiniti lt9950.010400.0Wed Jul 08 2015 17:15:00 GMT-0700 (PDT)
5588312011BMW5 Series528iSedanautomaticwbafr1c53bc744672fl39.066403.0whitebrownlauderdale imports ltd bmw pembrok pines20300.022800.0Tue Jul 07 2015 06:15:00 GMT-0700 (PDT)
5588322015KiaK900LuxurySedanNaNknalw4d4xf6019304in45.018255.0silverblackavis corporation35300.033000.0Thu Jul 09 2015 07:00:00 GMT-0700 (PDT)
5588332012Ram2500Power WagonCrew Cabautomatic3c6td5et6cg112407wa5.054393.0whiteblacki -5 uhlmann rv30200.030800.0Wed Jul 08 2015 09:30:00 GMT-0700 (PDT)
5588342012BMWX5xDrive35dSUVautomatic5uxzw0c58cl668465ca48.050561.0blackblackfinancial services remarketing (lease)29800.034000.0Wed Jul 08 2015 09:30:00 GMT-0700 (PDT)
5588352015NissanAltima2.5 Ssedanautomatic1n4al3ap0fc216050ga38.016658.0whiteblackenterprise vehicle exchange / tra / rental / tulsa15100.011100.0Thu Jul 09 2015 06:45:00 GMT-0700 (PDT)
5588362014FordF-150XLTSuperCrewautomatic1ftfw1et2eke87277ca34.015008.0graygrayford motor credit company llc pd29600.026700.0Thu May 28 2015 05:30:00 GMT-0700 (PDT)